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Abstract. We show that each level of the quantifier alternation hierar- 
chy within F0^[<] on words is a variety of languages. We use the notion 
of condensed rankers, a refinement of the rankers defined by Weis and 
Immerman, to produce a decidable hierarchy of varieties which is inter- 
woven with the quantifier alternation hierarchy - and conjecturally equal 
to it. It follows that the latter hierarchy is decidable within one unit, a 
much more precise result than what is known about the quantifier al- 
ternation hierarchy within F0[<], where no decidability result is known 
beyond the very first levels. 



First-order logic is an important object of study in connection with computer 
science and language theory, not least because many important and natural 
problems are first-order definable: our understanding of the expressive power of 
this logic and the efficiency of the solution of related algorithmic problems are 
of direct interest in such fields as verification. Here, by first-order logic, we mean 
the first-order logic of the linear order, F0[<], interpreted on finite words. 

In this context, there has been continued interest in fragments of first-order 
logic, defined by the limitation of certain resources, e.g. the quantifier alterna- 
tion hierarchy (which is closely related with the dot-depth hierarchy of star-free 
languages). It is still an open problem whether each level of this hierarchy is de- 
cidable O Another natural restriction concerns the number of variables used (and 
re-used!) in a formula. It is interesting, notably because the trade-off between 
formula size and number of variables is known to be related with the trade-off 
between parallel time and number of processes, see |18I5I1I4| . 

In this paper, we concentrate on F0^[<], the 2- variable fragment of F0[<]. 
It is well-known that every FO[<]-formula is logically equivalent with a formula 
using only 3 variables, but that F0^[<] is properly less expressive than F0[<]. 
The expressive power of F0^[<] was characterized in many interesting fashions 
(see |12I14I15 T?). and in particular, we know how to decide whether an F0[<]- 
formula is equivalent to one in F0^[<]. 



* Both authors acknowledge support from the ANR project DOTS, the ESF program 
AutoMathA and the Indo-French P2R project modiste-COVER. 
On the other hand, the quantifier alternation hierarchy collapses at level 2 for the 
first-order logic of the successor FO[S] tl6l9| . 



A recent result of Weis and Immerman refined a result of Schwcntick, Therien 
and VoUmer [IJj to give a combinatorial description of the FO^j[<]-definable 
languages (those that can be defined by an FO^[<]-forniula with quantifier al- 
ternation bounded above by to), using the notion of rankers. Rankers are finite 
sequences of instructions of the form go to the next a-position to the right (resp. 
left) of the current position. 

Our first set of results shows that J-O^-^ (the FOf,J<]-definable languages), 
and the classes of languages defined by rankers having m alternations of direc- 
tions (right vs. left), are varieties of languages. This means that membership of 
a language L in these classes depends only on the syntactic monoid of L, which 
justifies an algebraic approach of decidability. 

Our investigation shows that rankers are actually better suited to charac- 
terize a natural hierarchy within unary temporal logic, and we introduce the 
new notion of a condensed ranker, that is more adapted to discuss the quantifier 
alternation hierarchy within F0^[<]. There again, the alternation of directions 
in rankers defines hierarchies of varieties of languages TZm and Cm , with partic- 
ularly interesting properties. Indeed, we show that these varieties are decidable, 
that they admit a neat characterization in terms of closure under deterministic 
and co-deterministic products, and that TZm U Cm ^ ^C>m ^ TZm+i H Cm+i- 
The latter containments show that we can effectively compute, given a language 
L £ TO^, an integer to such that L is in J^Om+i, possibly in J-O^, but not in 
J'O^.i. This is much more precise than the current level of knowledge on the 
general quantifier alternation hierarchy in FO[<]l£| 

1 An algebraic approach to study FO^ 

If M G is a non-empty word, we denote by u[i] the letter of u in position i 
(1 < i < |w|), and by u[i,j] be the factor u[i] ■ ■ •u[j] of u (I < i < j < |u|). Then 
we identify the word u with the logical structure ({I, . . . , (a.)aeA), where a 
denotes the set of integers i such that u[i] — a. 

Let F0[<] (resp. F0'^[<], fc > 0) denote the set of first-order formulas using 
the unary predicates a (a G A) and the binary predicate < (resp. and at most 
k variable symbols). It is well-known that F0'^[<] is as expressive as F0[<] and 
that F0^[<] is properly less expressive. 

In the sequel, we omit specifying the predicate < and we write simply FO or 
FO*^. The classes of FO- and FO'^-definable languages have well-known beautiful 
characterizations |12II4I15T3] . Two are of particular interest in this paper. 

- The algebraic characterization in terms of recognizing monoids: a language 
is FO-definable if and only if it is recognized by a finite aperiodic monoid, i.e., one 
in which x" = for each element x and for all n large enough (Schiitzenberger 
and McNaughton-Ladner, see [13]); and a language is FO^-definable if and only 
if it is recognized by a finite monoid in DA (see jl4j). a class of monoids with 

^ Unfortunately, it does not help with the general problem since a language L is 
FO^[<]-definable if and only if L and its complement are X'2-definable [llj . 
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many interesting characterizations, which will be discussed later. These algebraic 
characterizations prove the decidability of the corresponding classes of languages: 
L is FO (resp. FO^) definable if and only if the (effectively computable) syntactic 
monoid of L is in the (decidable) class of aperiodic monoids (resp. in DA). 

- The language-theoretic characterization: a language is in FO-definable if 
and only if it is star- free, i.e., it can be obtained from singletons using Boolean 
operations and concatenation products (Schiitzenberger, see [8 ); a language is 
FO^-definable if and only if it can be written as the disjoint union of unambiguous 
products of the form B'^aiBl ■ ■ ■ akBI, where fc > 0, the are letters and 
the Bi are subsets of the alphabet. Such a product is called unambiguous if 
each word u G BqUiBI ■ ■ ■ auB^ admits a unique factorization in the form u = 
UottiUi ■ ■ ■ akUk such that Ui G B* for each i. 

We now concentrate on FO^-formulas and we define two important param- 
eters concerning such formulas. To simplify matters, we consider only formulas 
where negation is used only on atomic formulas so that, in particular, no quan- 
tifier is negated. This is naturally possible up to logical equivalence. Now, with 
each formula ip G FO^, we associate in the natural way a parsing tree: each oc- 
currence of a quantification, 3x or Vx, yields a unary node, each occurrence of 
V or A yields a binary node, and the leaves are labeled with atomic or negated 
atomic formulas. Each path from root to leaf in this parsing tree has a quantifier 
label, which is the sequence of quantifier node labels (3 or V) encountered along 
this path. A block in this quantifier label is a maximal factor consisting only of 
3 or only of V. The quantifier depth of ip is the maximum length of the quantifier 
label of a path in the parsing tree of ip, and the number of blocks of (p is the 
maximum number of blocks in the quantifier label of a path in its parsing tree. 

We let FO^ „ denote the set of first-order formulas with quantifier depth at 
most n and with at most m blocks and let FO^ denote the union of the FO^ „ for 
all n. We also denote by TO^ {J-O'^) the class of FO^ (FO^)-definable languages. 
Weis and Immerman's characterization of the expressive power of FO^ |18j 
in terms of rankers, see Theorem 11.21 below . forms the basis of our own results. 

1.1 Rankers and logic 

A ranker 18J is a non-empty word on the alphabet {Xa, Ya | a G Rankers 
may define positions in words: given a word u G and a letter a G A, we 
denote by Xa{u) (resp. Ya(u)) the least (resp. greatest) integer I < i < \u\ such 
that u[i] = a. li a does not occur in u, we say that Ya(u) and X(j(it) are not 
defined. If in addition q is an integer such that 1 < q < we let 

Xa{u,q) ^ Xa{u[q + 1, \u\]) 
Ya(M,g) =Ya(u[l,g-l]). 



Weis and Immerman write t>a and <la instead of Xa and Y^. We rather follow the 
notation in [3], where X and Y refer to the future and past operators of LTL. 
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These definitions are extended to all rankers: if r' is a ranker, Z S {X^, Yq | a G 
A} and r — r'Z, we let r{u,q) — Z(u,r'(u,q)) if r'{u,q) and Z{u,r'{u,q)) are 
defined, and we say that r{u, q) is undefined otherwise. 

Finally, if r starts with an X- (resp. Y-) letter, we say that r defines the 
position r(u) — r(u,0) (resp. r{u) — r{u, \u\ + 1)), or that it is undefined on u if 
this position does not exist. Then L{r) is the language of all words on which r is 
defined. We say that the words u and v agree on a class R of rankers if exactly 
the same rankers from R are defined on u and v. 

The depth of a ranker r is defined to be its length (as a word). A block in 
r is a maximal factor in {Xa \ a e A}+ (an X-block) or in {Yq | a e A}^ (a 
Y-block). If n > m, we denote by i?^ „ (resp. i?^ „ ) the set of m-block, depth 
n rankers, starting with an X -(resp. Y-) block, and we let Rm,n — Rm n ^ i?^ n 
and ^f„_„ = U„'<„^m,n' U Um'<m,n'<«-Rm',n'- We define ^^_„ dually and we 

let R^ = {Jn>mJim,m Bim = V}n>mS^m,n ^nd i?„ = R„^ U i?„ . 

Rankers and temporal logic Let us depart for a moment from the consider- 
ation of FO^-formulas, to observe that rankers are naturally suited to describe 
the different levels of a natural class of temporal logic. The symbols X^ and 
Yq (a e A) can be seen as modal (temporal) operators, with the future and 
past semantics respectively. We denote the resulting temporal logic (known as 
unary temporal logic) by TL: its only atomic formula is T, the other formulas 
are built using Boolean connectives and modal operators. Let u G and let 
< I < |u| -I- 1. We say that T holds at every position i, {u,i) ^ T; Boolean 
connectives are interpreted as usual; and {u, i) \= Xa^p (resp. Ya^) if and only if 
(u,j) ^ if, where j is the least a-position such that i < j (resp. the greatest a- 
position such that j < i). We also say that u |= Xaf (resp. Vaf) if (u, 0) |= Xa'P 
(resp. {u, 1 + \u\) 1= Ya<^). 

TL is a fragment of propositional temporal logic PTL; the latter is expressively 
equivalent to FO and TL is expressively equivalent to FO^, see [14] . 

As in the case of FO^-formulas, one may consider the parsing tree of a TL- 
formula and define inductively its depth and number of alternations (between 
past and future operators). If n > m, the fragment TL^ „ (resp. TL^ „) consists 
of the TL-formulas with depth n and with m alternations, in which every branch 
(of the parsing tree) with exactly m alternations starts with future (resp. past) 
operators. The fragments TL„j_„, TL^, „, TL ^ TL^, TL ^ and TL„ are defined 
according to the same pattern as in the definition of Rm,n, I^^rn i?m,ni i?m: i^m 
and i?„j. We also denote by TC^ „ (^-^mi T£ ^, etc) the class of TL^ (TL^, 
T£^j, etc)-definable languages. The following result is elementary. 

Proposition 1.1. Let 1 < m < n. Two words satisfy the same TL^ „ formulas 
if and only if they agree on rankers from R^ „ . A language is in TC^-^ „ if and 
only if it is a Boolean combination of languages of the form L{r), r G i?^^ 

Similar statements hold for TL^ „, TL„i,ti, TL^ „ . TL^ „ , TL^ , TL ^ and 
TL^ , relative to the corresponding classes of rankers. 
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Rankers and FO^ The connection established by Weis and Immerman [T5] 
between rankers and formulas in FO^j„, Theorem 11.21 below, is deeper. If x,y 
are integers, we let ord(a;,?;), the order type of x and y, be one of the symbols 
<, > or =, depending on whether x < y, x > y or x = y. 

Theorem 1.2. Let u,v ^ A* and let 1 < m < n. Then u and v satisfy the same 
formulas in FO^ „ if and only if 

(WI 1) u and v agree on rankers from Rm.n, 

(WI 2) if the rankers r G „ and r' G Rm-i n-i '^''^ defined on u and v, then 

ord(r(w), r'{u)) — ord(r(u), r'{v)). 
(WI 3) if r £ i?„j „ and r' G i?„j „_]^ are defined on u and v and end with 

different direction letters, then ord(r{u),r' (u)) = ord{r{v),r' (v)). 

Corollary 1.3. For each n>m>l, XLm,n ^ ^^L,n ^nd T C J^O^^. 

FO^ and TL^-definable languages form varieties Our first result is the 
following. We refer the reader to ^ and to Section 11.21 below for background 
and discussion on varieties of languages. 

Proposition 1.4. For eachn >m>l, the classes T£^^„ TC \ „ , T£ ^ , T£^, 
TL^Lm n! 1-£^rm -^^m n '^"■'^ -^^m '^''"^ Varieties of languages. 

Sketch of proof. Let Pm^n be the relation for two words to agree on TL ^ „- 
formulas. Using Proposition ll.il one verifies that Pm.n is a finite index congru- 
ence. Then a language is TL ^ .^-definable if and only if it is a union of Pm,n- 
classes, if and only if it is recognized by the finite monoid A* /pm,n- It follows 
that these languages are exactly those accepted by the monoids in the pseudova- 
riety generated by the A* /pm.n, for all finite alphabets A, and hence they form 
a variety of languages. 

The proof for the other fragments of TL is similar. For the fragments of FO^, 
we use Theorem 11.21 instead of Proposition [Ol □ 

This result shows that, for a given regular language L, TL^- (resp. TL^-, 
FO^-, etc) definability is characterized algebraically, that is, it depends only on 
the syntactic monoid of L. This justifies using the algebraic path to tackle de- 
cidability of these definability problems. Eilenberg's theory of varieties provides 
the mathematical framework. 

1.2 A short survey on varieties and pseudovarieties 

We summarize in this section the information on monoid and variety theory that 
will be relevant for our purpose, see [812114115] for more details. 

A language L Q A* is recognized by a monoid M if there exists a morphism 
ip: A* M such that L = ip~^{ip{L)). For instance, if u G A* and i? C A, let 
alph(u) — {a ^ A\u — vaw for some v,w & A*} and [B] = {u G A* | alph(w) = 
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B}. Then [B] is recognized by the direct product of copies of the 2-elenient 
monoid {0, 1} (multipUcative). 

A pseudovariety of monoids is a class of finite monoids closed under tak- 
ing direct products, homomorphic images and submonoids. Pseudovarieties of 
subsemigroups are defined similarly. A class of languages V is a collection V = 
{V{A))ji, indexed by all finite alphabets A, such that V{A) is a set of languages in 
A*. If V is a pseudovariety of monoids, we let V{A) be the set of languages of A* 
recognized by a monoid in V. The class V is closed under Boolean operations, 
residuals and inverse homomorphic images. Classes of recognizable languages 
with these properties are called varieties of languages, and Eilenberg's theorem 
(see [8]) states that the correspondence V i-^ V, from pseudovarieties of monoids 
to varieties of languages, is one-to-one and onto. Moreover, the decidability of 
membership in the pseudovariety V, implies the decidability of the variety V: 
indeed, a language is in V if and only if its (effectively computable) syntactic 
monoid is in V. 

For every finite semigroup S and s G 5, we denote by s'^ the unique power 
of s which is idempotent. The Green relations are another important concept 
to describe monoids: if 5 is a monoid and s,t G S, we say that s <j t (resp. 
s <n s <c t) if s = utv (resp. s = tv, s = ut) for some u,v G S. We also say 
that s t is s <j t and t <j s. The relations 7Z. and C are defined similarly. 

Pseudovarieties that will be important in this paper are the following. 

- Ji, the pseudovariety of idempotent and commutative monoids, whose cor- 
responding variety of languages consists of the Boolean combinations of lan- 
guages of the form [B] . 

- R, L and J, the pseudovarieties of TZ-, C- and j7-trivial monoids; a monoid 
is, say, 7?.-trivial if each of its 7?.-classes is a singleton. 

- DA, the pseudovariety of all monoids in which {xy)'^x{xy)^ — (xy)'^ for 
all X, y; DA has a great many characterizations in combinatorial, algebraic and 
logical terms |2I11I12I14I15] . 

- K (resp. D, LI) is the pseudovariety of semigroups in which x'^y = a;" 
(resp. yx" = cc", = x") for all x^y. 

Finally, if V is a pseudovariety of semigroups and W is a pseudovariety of 
monoids, we say that a finite monoid M lies in the Mal'cev product W @ V if 
there exists a finite monoid T and onto morphisms a: T ^ M and p: T —f N 
such that e W and /3~^(e) G V for each idempotent e of A^. Then W @ V is 
a pseudovariety of monoids and we have in particular [812110] : 

K@Ji=K@J = R, D@Ji=D@J = L, LI@ Ji = LI@ J = DA. 

We denote by TLf„_„ TL1,„, TL;^, TL^„, TL„,„, TL„, F0^,„ and FO^ the 
pseudovarieties corresponding to the language varieties discovered in Proposi- 
tion O 

2 Main results 

Our main tool to approach the decidability of FO^-definability lies in a variant 
of rankers, which we borrow from a proof in Weis and Immerman's paper [18] . As 
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in the turtle language of |12| . a ranker can be seen as a sequence of instructions: 
go to the next a to the right, go to the next b to the left, etc. We say that a 
ranker r is condensed on u if it is defined on u, and if the sequence of positions 
visited zooms in on r(u), never crossing over a position already visited. Formally, 
r — Zi ■ ■ ■ Zn is condensed on u if there exists a chain of open intervals 

(0, \u\ + 1) = (io, jo) ^ ji) D • ■ ■ D (in-i, 3 r{u) 

such that for all 1 < ^ < n — 1 the following properties are satisfied: 

- If Z^Zf+i = XaXb then (iiji) = (Xq(u, 

- If ZiZi+i = YaYb then (i/;, jf) = Ya(u, j£_i). 

- If ZfZf+i ^XaVb then = (i^^i, Xa(u, 

- If ZtZi+i = YaXf, then (ieje) = (Yq(u, j^-i). 

For instance, the ranker XaY^Xc is defined on the words bac and bca, but 
it is condensed only on bca. Rankers in R^, or of the form XaYf,j^ ' ' '^bk or 
YflXfij • • • Xhf. , are condensed on all words on which they are defined. We denote 
by Lc{r) the set of all words on which r is condensed. 

Condensed rankers form a natural notion, which is equally well-suited to the 
task of describing FO^j-definability (see Theorem 12.41 below). With respect to 
TL, for which Proposition 11.11 shows a perfect match with the notion of rankers, 
they can be interpreted as adding a strong notion of unambiguity, see Section [3] 
below and the work of Lodaya, Pandya and Shah '7\ on unambiguous interval 
temporal logic. 



2.1 Condensed rankers determine a hierarchy of pseudovarieties 

Let us say that two words u and v agree on condensed rankers from a set R of 
rankers, if the same rankers are condensed on u and v. We write u c>m,n v (resp. 
u <im,n v) if u and v agree on condensed rankers in „ (resp. R^ „). 

These relations turn out to have a very nice recursive characterization. For 
each word u € A* and letter a occurring in u, the a-left (resp. a-right) fac- 
torization of u is the factorization that isolates the leftmost (resp. rightmost) 
occurrence of a in u; that is, the factorization u = u_au+ such that a does not 
occur in u- (resp. m+). We say that the word ai ■ • ■ is a subword of w if u can 
be factored as u = u^aiui ■ ■ ■ OrUr, with the Ui G A* . 

Proposition 2.1. The relations >m,n o,n-d- <m.n (n > m > \) are uniquely de- 
termined by the following properties. 

- u >i^„ V if and only if u <li^„ v, if and only if u and v have the same 
subwords of length at most n. 

- If m>l, then u\>m.n v if and only j/alph(u) = alph(w), u <m-i.n-i v and 
for each letter a G alph(u), the a-left factorizations u — u_au+ and v = V-OVj^ 
satisfy 

^rn— l.n— 1 ^— and Uj^ t^m.n— 1 ^-t- • 

- //to > 1, then M <m.ri V if and only i/alph(w) — alph(w), u >m-i.n-i v and 
for each letter a G alph(7i), the a-right factorizations u — u^au-^- and v — v-ov^ 

satisfy U+ >m-l,n-l V+ and U- <m,n-l V-. 
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Corollary 2.2. The relations t>rn,n o-nd <m,n o-i^e finite-index congruences. 

For each m > 1, let us denote by R™ (resp. L^) the pseudovariety generated 
by the quotients A*/ t>rn.n (resp. A*/ <im.n), where n > m and A is a finite 
alphabet. Corollary 12.21 shows that a language L is in the corresponding variety 
TZm (resp. Cm) if and only if L is a Boolean combination of languages of the 
form Lc{r), with r G -R^j (resp. R^)- 

By definition, for all m > 1, Rm and L.^ are contained in both Rm+i and 
Lm+i. According to the first statement of Proposition [^Hl >i.n=<>i.n is the con- 
gruence defining the piecewise n-testable languages studied by Simon in the early 
1970s, and that, in consequence, Ri = Li = J, the pseudovariety of J'-trivial 
monoids j8j. 

In addition, one can show that if a position in a word u is defined by a ranker 
r e „ (resp. „), then the same position is defined by a ranker s G i?^ „ 
(resp. „) which is condensed on u. This leads to the following result. 

Proposition 2.3. Let n > m > 1. If the words u and v agree on condensed 
rankers in i?^ „ (resp. i?^ then they agree on rankers from the same class. 
In particular, TL ^ C R,„ and TL^ C L„j 

As indicated above, condensed rankers allow for a description of FO^-dcfin- 
ability, as neat as with ordinary rankers: more precisely, we show that the state- 
ment of Weis and Immcrman's theorem can be modified to used condensed 
rankers instead. 

Theorem 2.4. Let u,v G A* and let 1 < m < n. Then u and v satisfy the same 
formulas in FO^ „ if and only if 

(WI Ic) u and v agree on condensed rankers from Rm,n, 

(WI 2c) if the rankers r G R^ „ '^'^^ ^ Rm-i n-i '^''^ condensed on u and v, 

then ord{r(u),r' {u)) — orA{r{v)^r' [v)) . 
(WI 3c) if r E Rm n r' G i?^ are condensed on u and v and end with 

different direction letters, then ord(r(it), r'(u)) = ord(r(t;), r'(w)). 

Thus there is a connection between TO'^ and the varieties TZm and Cm- But 
much more can be said about the latter varieties. 

2.2 Language hierarchies 

Proposition 12.11 also leads to a description of the language varieties IZm and Cm 
in terms of deterministic and co-deterministic products. Recall that a product of 
languages L = LqOiLi ■ ■ ■ a^Lk {k > 1, fli G A, Li C A*) is said to be determin- 
istic if, for < i < fc, each word u G L has a unique prefix in LqoiLi ■ ■ ■ Li-iOi. 
If for each i, the letter Oi does not occur in Li-i, the product L^aiLi ■ ■ ■ auLk is 
called visibly deterministic: this is obviously a particular case of a deterministic 
product. 

The definition of a co- deterministic or visibly co- deterministic product is 
dual, in terms of suffixes instead of prefixes. If V is a class of languages and A is 
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a finite alphabet, let V^^^A) (resp. V'^'^^A), V™'*^*(A), v^'=°'''=*(A)) be the set of 
all Boolean combinations of languages of V(A) and of deterministic (resp. visibly 
deterministic, co-deterministic, visibly co-deterministic) products of languages of 
V{A). Schiitzenberger gave algebraic characterizations of the closure operations 
V I — » V"''^* and V i — > \;codet^ [8]: if V is a variety of languages and if V is 
the corresponding pseudovariety of monoids, then V^*^* and V^"'''^* are varieties 
of languages and the corresponding pseudovarieties are, respectively, K@ V and 
D @ V. Then we show the following. 

Proposition 2.5. For each m > 1, we have TZm+i — t^m'^'*^ — ^n? t R-m+i = 
K@L„, £,„+! = 7^;;^°'*'=* = 7^™'*'^* anrfL,„+i = D@R,„. In particular, Ra = R 
and L2 = L. 

Sketch of proof. Proposition 12.11 shows that TZm+i ^ ^m^^^ which is trivially 
contained in Cf^f* . The last containment is proved algebraically, by showing that 
if 7: A* — > Af is an onto morphism, and Af e K @ L„, , then for some large 
enough n, u (>m+i,7i v implies 7(1*) = j{v): thus M is a quotient of A* / >„i+i.„ 
and hence, M G R„j+i. This proof relies on a technical property of semigroups 
in DA: if a £ A occurs in alph(w) and j{u) TZ j{uv), then ^{uva) gRy{u). □ 

It turns out that the Rm and the were studied in the semigroup-theoretic 
literature (Kufleitner, Trotter and Weil, [ITIGp . In [5], it is defined as the hierar- 
chy of pseudovarieties obtained from J by repeated applications of the operations 
X I— > K @ X and X D @ X. Proposition 12.51 shows that it is the same hier- 
archy as that considered in this papeiQ. The following results are proved in [SI 
Section 4]. 

Proposition 2.6. The hierarchies (Rm)m and (Lm)^ are infinite chains of de- 
cidable pseudovarieties, and their unions are equal to DA. Moreover, every m- 
generated monoid in DA lies in Rm.+i H Jjm+i- 

The decidability statement in Proposition 12.61 is in fact a consequence of a 
more precise statement (see [1716] ) which gives defining pseudoidentities for the 
Rm and L^. Let xi,X2-, ■ ■ ■ be a sequence of variables. If m is a word on that 
alphabet, we let u be the mirror image of u, that is, the word obtained from 
reading u from right to left. We let 

€2^x2X1, 12^x2X1X2, 
for n > 2, Gn — XnGn-i, In — G„a;„/„_i, 

lf{xi) = (X1X2X1 )'^, (p{x2) = X2, 

and, for n > 2, ip{x„) = ix^ip{G^Gn-ir x'::r . 
Then we have [5]: 

More precisely, the pseudovarieties Rm and L,„ in [S] are pseudovarieties of semi- 
groups, and the Rm and Lm considered in this paper are the classes of monoids in 
these pseudovarieties. 
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Proposition 2.7. For each m > 2, — DA n |(/3(Gm) = f{Im)} and L„i = 

Example 2.8. For R2, this yields the pseudo-identity XalxiXaa^i)" = (a;i 2:2 ) 
One can verify that, together with the pseudo-identity defining DA, this is equiv- 
alent to the usual pseudo-identity describing R = R2, namely (st)'^s — {st)'^ . 
For R3 = K @ L, no pseudo-identity was known in the literature. We get 

\-^3\\-^l-^2-^l ) •''2 l-'-l •''2 -^1 J -^3 ) -^2 V-'-l ■''2 ■''1 J ■''2 ■ 

2.3 Connection vi^ith the TL^ and the FO^ hierarchies 

Proposition 12.31 established a containment between the R„i (resp. L^) and the 
TL,„ hierarchies. A technical analysis allows us to prove a containment in the 
other direction, but one that is not very tight - showing the difference between 
the consideration of condensed rankers and that of ordinary rankers. 

Proposition 2.9. R2 = TL , and L2 C TL ,. If m > 3 and if two words agree 
on rankers in i?j*'3„j/2j (f^sp. I^y3rn/2\)' then they agree on condensed rankers in 
(resp. R^). In particular R™ C TLj^3„/2j and L,„ C TL|3„j/2J • 

Example 2.10. The language Lci)^ayb^c) is in TZz and not in . 

The connection between the Rm, and FO^ hierarchies is tighter. 

Theorem 2.11. Let m > 1. Every language in TZm or Cm is FO^-i^-definable, 
and every FO^^- definable language is in TZm+i HAn+i- Equivalently, we have 

Rm V L.,„ C FO^ C Rm+i n Lm+1, 

where V V W denotes the least pseudovariety containing V and W. 

Sketch of proof. The containment R„, V C FO^j follows directly from 
Property (WI Ic) in Theorem 12.41 The proof of the converse containment also 
relies on that theorem. We show that if u \>m+i.2n or u <m+i.2n, then Properties 
(WI Ic), (WI 2c) and (WI 3c) hold for m, n. This is done by a complex and 
quite technical induction. □ 

lim — 1, we know that R2 nL2=RnL = J = Ri VLi: this reflects the ele- 
mentary observation that FO^-definable languages, like FOi-definable languages, 
are the piecewise testable languages. For m > 2, we conjecture that R™ V is 
properly contained in Rm+i n 'Lm+i- The following shows it holds for m — 2. 
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Example 2.12. L — {b^c\*ca{a,b}* is F02-definable, by the following formula: 

3i (c(^) A (Vj {]<i^ Mj))) a (Vj {j>i^ -c(j)))) 
A 3i (a(^) A (Vj (j < i ^ MM A (Vj (j > i ^ -c(j)))) 
A Vi (b(i) ^ (3j (i < z A a(j)) V (3j > z A c(j)))). 

The words m„ = (6c)" (a6)" are in L, while the words u„ = (6c)"&(ca)" are not. 
Almeida and Azevedo showed that R2 V L2 is defined by the pseudo-identity 
(6c)"(a5)" = {bcYh{abY % Theorem 9.2.13 and Exercise 9.2.15]). In particular, 
for each language K recognized by a monoid in R2 V L2 , the words w„ and u„ 
(for n large enough) are all in K, or all in the complement of K . Therefore L is 
not recognized by such a monoid, which proves that R2 V L2 is strictly contained 
in FO2, and hence also in R3 fl L3. It also shows that TC 2 is properly contained 
in TOl. 

Finally, we formulate the following conjecture. 
Conjecture 2.13. For each m > 1, FO^ = Rm+i n L„i+i. 

3 Consequences 

The main consequence we draw of Theorem 12.111 and of the decidability of the 
pseudovarieties R„i and L„i is summarized in the next statement. 

Theorem 3.1. Given an VO^ -definable language L, one can compute an integer 
m such that L is fO^_^_i-definable but not fO^_i-definable. That is: we can 
decide the quantifier alternation level of L within one unit. 

Sketch of proof. If M G DA, we can compute the largest m such that M ^ 
Rm n Lm (Proposition EH). Then M ^ FO^„+i \ FO,^„„i by Theorem [2ini □ 

The fact that the Rm and L„i form strict hierarchies (Proposition [2T6]) , to- 
gether with Theorem 12. Ill proves that the TO].^ hierarchy is infinite. Weis and 
Immerman had already proved this result by combinatorial means I18j , whereas 
our proof is algebraic. From that result on the TO]-^., it is also possible to recover 
the strict hierarchy result on the R™ and L™ and the fact that their union is 
equal to DA. 

By the same token, Propositions l2 . 31 and 12 .91 show that the (resp. TL„) 

hierarchy is infinite and that its union is all of J'^O^ (resp. DA). 

Similarly, the fact that an m-generated element of DA lies in Rm+i H Lm+i 
(Proposition 12. 6p . shows that an FO^-definable language in A* lies in 7?,|yi[+i fl 
and hence in TO^^j^^ - a fact that was already established by combina- 
torial means by Weis and Immerman [l8l Theorem 4.6]. It also shows that such 
a language is in 2I£|(|yi|+i) by Proposition 12.91 

Finally we note the following refinement on Proposition 4.6]. It was men- 
tioned in the introduction that the languages in TO^ are disjoint unions of 
unambiguous products of the form B'!^a\B\ ■ ■ ■ auB^, where each Bi is a subset 
of A. Propositions 12.51 and 12.61 implv the following statement. 
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Proposition 3.2. The least variety of languages containing the languages of 
the form B* (B Q A) and closed under visibly deterministic and visibly co- 
deterministic products, is TO^ . 

Every unambiguous product of languages of the form BgOiB^ ■ ■ ■ flfcB^ (with 
each Bi A), can be expressed in terms of the B* and the ai using only Boolean 
operations and at most \A\ + 1 applications of visibly deterministic and vis- 
ibly CO- deterministic products, starting with a visibly deterministic (resp. co- 
deterministic ) product. 

The weaker statement with the word visibly deleted was proved by the au- 
thors in fB] , as well as by Lodaya, Pandya and Shah [7] . 
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